mlatoz

Thompson Sampling

Bayesian Inference


Thompson Sampling Algorithm

Step 1: At each round n, we consider two numbers for each ad i:

Step 2: For each ad i, we take a random draw from the distribution below:

     θi(n) = ß(N1i(n) + 1, N0i(n) + 1)

Step 3: We select the ad that has the highest θi(n).


UCB Algorithm v/s Thompson Sampling Algorithm

Upper Confidence Bound Thompson Sampling
UCB Thompson Sampling
Deterministic Probabilistic
Requires update at every round Can accommodate delayed feedback
Less empirical evidence than Thompson Sampling Better empirical evidence than UCB

Download Resources


«Previous Next»